Soundspotter / remix-TV: Fast Approximate Matching for audio and video Performance

نویسندگان

  • Michael A. Casey
  • Mick Grierson
چکیده

SoundSpotter is an open source software system for real-time matching of an audio input stream to a database of continuous audio or video. Among its novel features are real-time control over audio segmentation, feature selection and match radius. The system uses audio input to control selection of output from a database using similarity-based matching. The low latency methods employed create a feedback loop between the performer and the database, thus it is a type of electronic musical instrument. We employ exact nearest neighbor searching on variable-length sequences of audio features for matching. The feature space is controlled by range selection over the vector dimensions determining which are left out of similarity calculations. The current implementation is capable of real-time matching in tens of hours of audio or video on current laptop hardware. Finally, we describe SoundSpotter’s video editing utility in an application called REMIX-TV.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Histogram Algorithm for Fast Audio Retrieval

This paper describes a fast audio detection method for specific audio retrieval in the AV stream. The method is a histogram matching algorithm based on structural and perceptual features. This algorithm extracts audio features based on human perception on the sound scene and locates the special audio clip by fast histogram matching. Experimental results based on the advertisement detection in T...

متن کامل

Commercial Break Detection and Content Based Video Retrieval

This chapter presents a novel approach for automatic annotation and content based video retrieval by making use of the features extracted during the process of detecting commercial boundaries in a recorded Television (TV) program. In this approach, commercial boundaries are primarily detected using audio and the detected boundaries are validated and enhanced using splash screen of a program in ...

متن کامل

From Video and Audio Recurrences to Unsupervised Program Structuring

This paper addresses the problem of unsupervised TV programs structuring. Program structuring allows direct and non linear access to the desired parts of a program. Our work addresses the structuring of recurrent TV programs like news, entertainment programs, TV shows, TV magazines... In our previous work [1] we proposed a program structuring method based on the detection of video recurrences. ...

متن کامل

Matching Content to the Mobile User Smart Recommendations for Pervasive TV and Video

This publication presents our work on recommender systems for mobile audio-visual content. Our approach generates recommendations for media by extracting metadata and matching it with user-centric criteria such as mood preferences. We address the specific issues arising from mobility such as the need to minimize CPU-load, interaction complexity, as well as learning effort required from the user...

متن کامل

Simulation Model for Evaluation of the DVB-SH-A Performance

Today, mobile phone or other handheld device owners expect also receiving and watching a TV stream besides the classical voice or text services. The Digital Video Broadcasting – Satellite services to Handheld (DVB-SH) is designed to transport mobile TV services. It also supports a wide range of mobile multimedia services, e.g. audio and data broadcast as well as file download services [1], [2].

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007